Goto

Collaborating Authors

 compression ratio


649adc59afdef2a8b9e943f94a04b02f-Paper.pdf

Neural Information Processing Systems

But these methods are unable to improve throughput (frames-per-second) on real-life hardware while simultaneously preserving robustness toadversarial perturbations.



Andrey Kuzmin, Markus Nagel, Mart van Baalen, Arash Behboodi, Tijmen Blankevoort Qualcomm AI Research

Neural Information Processing Systems

In this paper, we set out to answer the question on which is better: neural network quantization or pruning? By answering this question, we hope to inform design decisions made on neural network hardware going forward. We provide an extensive comparison between the two techniques for compressing deep neural networks.